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Abstract. Is it possible to distinguish algebraic from transcendental real 
numbers by considering the fe-ary expansion in some base b ^ 2? In 1950, 
E. Borel suggested that the answer is no and that for any real irrational alge- 
braic number x and for any base g ^ 2, the g-ary expansion of x should satisfy 
some of the laws that are shared by almost all numbers. For instance, the 
frequency where a given finite sequence of digits occurs should depend only on 
the base and on the length of the sequence. 

We are very far from such a goal: there is no explicitly known example of 
a triple (g, a, x), where g ^ 3 is an integer, a a digit in {0, . . . , g — 1} and x 
a real irrational algebraic number, for which one can claim that the digit a 
occurs infinitely often in the g-ary expansion of x. 

Hence there is a huge gap between the established theory and the expected 
state of the art. However, some progress has been made recently, thanks mainly 
to clever use of Schmidt's subspace theorem. We review some of these results. 



1. Normal Numbers and Expansion of Fundamental Constants 

1.1. Borel and Normal Numbers. In two papers, the first pubHshed in 1909 
and the second 29J in 1950, Borel studied the g-ary expansion of real numbers, 
where 5 ^ 2 is a positive integer. In his second paper he suggested that this 
expansion for a real irrational algebraic number should satisfy some of the laws 
shared by almost all numbers, in the sense of Lebesgue measure. 

Let 5 ^ 2 be an integer. Any real number x has a unique expansion 

X = a-kg^ + . . . + a_ig + ao + aig^^ + 02,9^^ + . . . , 

where fc ^ is an integer and the for i — fc, namely the digits of x in the 
expansion in base g of x, belong to the set {0, 1, . . . , g — 1}. Uniqueness is subject 
to the condition that the sequence (ai)i^-k is not ultimately constant and equal to 
(7 — 1. We write this expansion 

X — a^k ■ ■ ■ a_iao.aia2 .... 

Example. We have 

V2 = 1.41421356237309504880168872420 . . . 

in base 10 {decimal expansion), whereas 

\/2 = 1.01101010000010011110011001100111111100111011110011 . . . 

in base 2 [binary expansion). 

The first question in this direction is whether each digit always occurs at least 
once. 

Conjecture 1.1. Let x he an real irrational algebraic number, g ^ S a positive 
integer and a an integer in the range ^ a ^ g — 1. Then the digit a occurs at 
least once in the g-ary expansion of x. 
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For 5 = 2 it is plain that each of the two digits and 1 occurs infinitely often 
in the binary expansion of an irrational algebraic number. The same is true for 
each of the sequences of two digits 01 and 10. Conjecture 11.11 implies that each 
of the sequences of digits 00 and 11 should also occur infinitely often in such an 
expansion; apply Conjecture II . II with 3 = 4. There is no explicitly known example 
of a triple (5, a, x), where g ^ 3 is an integer, a a digit in {0, . . . , g — 1} and x a 
real irrational algebraic number, for which one can claim that the digit a occurs 
infinitely often in the g-ary expansion of x. Another open problem is to produce 
an explicit pair {x,g), where 5 5^ 3 is an integer and x a real irrational algebraic 
number, for which we can claim that the number of digits which occur infinitely 
many times in the 5-ary expansion of x is at least 3. Even though few results are 
known and explicit examples are lacking, something is known. For any g ^ 2 and 
any fc ^ 1 there exist real algebraic numbers x such that any sequence of k digits 
occurs infinitely often in the g-ary expansion of x. However the connection with 
algebraicity is weak: indeed Mahler proved in 1973 more precisely that for any real 
irrational number a and any sequence of k digits in the set {0, ... ,5 — 1}, there 
exists an integer m for which the g-aiy expansion of ma contains infinitely many 
times the given sequence; see [3J Theorem M] and [SO]. According to Mahler, the 
smallest such m is bounded by (7^*^+^. This estimate has been improved by Berend 
and Boshernitzan 26\ to 2g'''^^ and one cannot get better than g'^ — 1. 

If a real number x satisfies Conjecture 1 1 . 1 1 for all g and a, then it follows that for 
any g, each given sequence of digits occurs infinitely often in the g-aiy expansion 
of X. This is easy to see by considering powers of g. 

Borel asked more precise questions on the frequency of occurrences of sequences 
of binary digits of real irrational algebraic numbers. We need to introduce some 
definitions. 

Firstly, a real number x is called simply normal in base g if each digit occurs 
with frequency 1/g in its (?-ary expansion. A very simple example in base 10 is 

X = 0.123456789012345678901234567890 . . . , 

where the sequence 1234567890 is repeated periodically, but this number is rational. 
We have 

_ 1234 567 890 _ 137174 210 
^ ~ 9 999 999 999 ~ 1 111 111 111' 
Secondly, a real number x is called normal in base g or g-normal if it is simply 
normal in base g™ for all m ^ 1. Hence a real number x is normal in base g if and 
only if, for all m ^5 1, each sequence of m digits occurs with frequency l/g™ in its 
g-avj expansion. 

Finally, a number is called normal if it is normal in all bases g ^ 2. 
Borel suggested in 1950 that each real irrational algebraic number should be 
normal. 

Conjecture 1.2 (Borel, 1950). Let x be a real irrational algebraic number and 
g ^ 2 a positive integer. Then x is normal in base g. 

As shown by Borel |28j , almost all numbers, in the sense of Lebesgue measure, are 
normal. Examples of computable normal numbers have been constructed by Sier- 
pinski, Lebesgue, Becher and Figueira; see ^Ij. However, the known algorithms to 
compute such examples are fairly complicated; indeed, "ridiculously exponential" , 
according to f2T . 

An example of a 2-normal number is the binary Champernowne number., obtained 
by concatenation of the sequence of integers 

0. 1 10 11 100 101 110 111 1000 1001 1010 1011 1100 1101 1110 . . . ; 
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see [351 [SH HI] ■ A closed formula for this number is 

k 



k^l 3=1 



see [21 page 183]. 

Another example is given by Korobov, Stoneham, and others: if a and g are 
coprime integers greater than 1, then 

is normal in base g; see [20) . 

A further example, due to Copeland and Erdos in 1946, of a normal number in 
base 10 is 

0.23571113171923..., 

obtained by concatenation of the sequence of prime numbers. 

As pointed out to the author by Tanguy Rivoal, a definition of what it means 
for a number to have a random sequence of digits is given in [33j . the first concrete 
example being Chaitin's omega number which is the halting probability of a universal 
self- delimiting computer with null-free data. This number, which is normal and 
transcendental, appears also to be a good candidate for being a non-period in the 
sense of Kontsevich and Zagier [33] . 



1.2. BBP Numbers. An interesting approach towards Conjecture [TT2l is provided 
by Hypothesis A of Bailey and Crandall [22] j who relate the question whether 
numbers like tt, log 2 and other constants are normal to the following hypothesis 
involving the behaviour of the orbits of a discrete dynamical system. 

Hypothesis A. Let 

gin) 

where g ^ 2 is an integer, R = p/q € Q{X) a rational function with q{n) ^ for 
n ^ 1 and degp < degq. Set ya — and 

yn+i ^ gyn + —rr mod 1. 

Then the sequence {yn)n^i either has finitely many limit points or is uniformly 
distributed modulo 1. 

A connection with special values of G-fimctions has been noted by Lagarias [H] . 
In his paper, Lagarias defines BBP numbers, with reference to the paper [21j by 
Bailey, Borwein and Plouffe, as numbers of the form 

-n 

^ qin) ^ ' 

where g 2 is an integer, p and q relatively prime polynomials in Z[A'] with 
q(n) 7^ for n ^ 1. 

Here are a few examples from [19) . Since 

log(l - a;) and — - — ^2n-i ^ 1 w ^ 



X 



n 



it follows that log 2 is a BBP number in base 2 as well as in base 9, since 
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Also log 3 is a BBP number in base 4, since 



log3 = 21og2 + log| = ;^^^ 

n>0 



y ( — ^ — ] 2-'". (2) 



and TT is a BBP number in base 16, since 

^ ^ ^ \ 8n + 1 8n + 4 8n + 5 8n + 6 

Further examples are tt^ in base 64 and in base 81, (log 2)^ in base 64, and C(3) in 
base 2^2 = 4096. 

1.3. Number of I's in the Binary Expansion of a Real Number. Denote 
by B{x,n) the number of I's among the first n binary digits of an irrational real 
number x. 

If X, y and x + y are positive and irrational, then for all sufficiently large n, 
B{x + y,n) ^ B{x, n) + B{y, n) + 1. 
If X, y and xy are positive and irrational, then for all sufficiently large n, 

B{xy, n) ^ B{x, n)B{y, n) + logs [x + y + l\. 
If X is positive and irrational, then for each integer A > 0, the bound 





log2 (^x H 


A 














X 





holds for all sufficiently large n; see ^01 [55] . 

A consequence is that if a and b are two integers, both at least 2, then none of 
the powers of the transcendental number 

is simply normal in base 2. Also the lower bound 

n) ^ n 

can be deduced [SS] . In [201 Theorem 7.1], Bailey, Borwein, Crandall and Pomerance 
have obtained a similar lower bound valid for all real irrational algebraic numbers. 

Theorem 1.3 (Bailey, Borwein, Crandall, and Pomerance, 2004). Let x be a real 
algebraic number of degree at least 2. Then there is a positive number C , which 
depends only on x, such that the number of 1 's among the first N digits in the 
binary expansion of x is at least CN^^'^. 

Further results related to Theorem 11.31 are given by Rivoal [SS] , Bugeaud [3T] , 
and Bugeaud and Evertse [52] . 

As pointed out by Bailey, Borwein, Crandall and Pomerance, it follows from 
Theorem 11.31 that for each d ^ 2, the number 

is transcendental. The transcendence of the number 

E 2-^" (3) 

goes back to Kempner in 1916; see [151 Section 13.10]. Other proofs are available, 
which rest either on Mahler's method, to be discussed in Section 13.11 or on the 
approximation theorem of Thue-Siegel-Roth-Ridout, to be discussed in Section 31 
see also [HH Section 1.6]. and [57] 1551 [T]. 
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Another consequence of Theorem 11.31 is the transcendence of the number 

for any integer <? ^ 2, where {Fn)n^o is the Fibonacci sequence, with Fq = 0, Fi = 1 
and Fn+i = F„ + F„_i for n ^ 1. Again, the transcendence of this number also 
follows from Mahler's method [5D] as well as from the theorem of Thue-Siegel-Roth- 
Ridout [S3 [HIT]. 



2. Words and Automata 
2.1. Words. We recall some basic facts from language theory; see for instance 

We consider an alphabet A with g letters. The free monoid A* on A is the set 
of finite words ai . . . a„ where n ^ and e A for I ^ i ^ n. The law on A* is 
called concatenation. 

The number of letters of a finite word is its length: the length of ai . . . is n. 

The number of words of length n is for n ^ 0. The single word of length is 
the empty word e with no letter. It is the neutral element for the concatenation. 

We shall consider infinite words w = a\a2a-i .... A factor of length m of such a 
word u; is a word of the form OkOk+i ■ ■ ■ ak+m-i for some k ^ 1. 

The complexity of an infinite word w is the function p(m) which counts, for each 
m ^ 1, the number of distinct factors of w of length m. Hence for an alphabet 
A with g elements we have 1 ^ p{m) ^ f/™, and the function m t—^ p{m) is non- 
decreasing. Conjecture 1 1.1 1 is equivalent to the assertion that the complexity of the 
sequence of digits in base g of an irrational algebraic number should be p{m) = g"^. 

An infinite word is periodic if and only if its complexity is bounded. If the 
complexity p{m) of a word satisfies p{m + 1) p(m) for one value of m, then 
p{m + k) = p{m) for all fc ^ 0, hence the word is periodic. It follows that the 
complexity of a non-periodic word satisfies p(rn) ^ m + 1. Following Morse and 
Hedlund, a word of minimal complexity p{m) = m + 1 is called a Sturmian word. 
Sturmian words are those which encode with two letters the orbits of square billiard 
starting with an irrational angle; see [16l [171 EZ] • It is easy to check that on the 
alphabet {a, 6}, a Sturmian word w is characterized by the property that for each 
m ^ 1, there is exactly one factor w of w of length m for which both va and vb are 
factors of w of length m + 1. 

Let A and B be two finite sets. A map from A to B* can be uniquely extended 
to a homomorphism between the free monoids A* and B* . We call such a homo- 
morphism a morphism from A to B. The morphism is uniform if all words in the 
image of A have the same length. 

Let (/? be a morphism from A into itself. Assume that there exists a letter a 
for which (p{a) = au, where u is a non-empty word satisfying ip''{u) ^ e for every 
fc ^ 0. Then the sequence of finite words {Lp''{a))k^i converges in A^ , endowed 
with the product topology of the discrete topology on each copy of A, to an infinite 
word w = au(p{u)(p'^ {u)ip^ (u) .... This infinite word is clearly a fixed point for ip 
and we say that w is generated by the morphism ip. 

If, moreover, every letter occurring in w occurs at least twice, then we say that 
w is generated by a recurrent morphism. 

If the alphabet A has two letters, then we say that w is generated by a binary 
morphism. 

More generally, an infinite sequence w in A^ is said to be morphic (respec- 
tively uniformly morphic) if there exist a sequence u generated by a morphism 
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(respectively a uniform morphism) defined over an alphabet B and a morphism 
(respectively a uniform morphism) tp from B to A such that w — ip{u). 

2.2. Finite Automata and Automatic Sequences. A formal definition of a 
finite automaton is given, for instance, in [THl Section 4.1], [351 Section 1.3.2], [T^l 
Section 3.3] and [SJ Section 3]. We do not give the exact definition but we propose 
a number of examples in Section 12.31 It suffices to say that a finite automaton 
consists of the following elements: 

• the input alphabet, which is usually the set of g ^ 2 digits {0, 1, . . . , g — 1}; 

• the set of states Q, usually a finite set of 2 or more elements, with one 
element, called the initial state and denoted by i, singled out; the elements 
of Q will be denoted by letters {i, a,b, . . .}; 

• the transition map Qx {0, 1, . . . , <?— 1} —^Q'.{a,n)i—f n[a] which associates 
to every state a a new state n[a] depending on the current input n; 

• the output alphabet A, together with the output map Q ^ A. 

We extend the transition map to a map (a,n) i— > n[a\ from Q x N — > Q as 
follows: let (a,n) G Q x N, replace n by its g-aiy expansion n — ekCk-i ■ ■ .ciCq 
and define inductively n[a] = CkCk-i ■ ■ ■ ei[6], where b = eo[a] is the image of (a, eg) 
under the transition map. Hence an automaton produces an infinite sequence 

0H,lW,2H,3W,...,nH,... 

of elements of Q. We take the images of the elements of this sequence under the 
transition map, and this gives rise to the output sequence. Such a sequence is called 
g-automatic. In other words, for an integer g ^ 2, an infinite sequence (a„)„^o of 
elements of {0, 1, . . . , (7 — 1} is said to be g-automatic if a„ is a finite-state function 
of the representation of n in base g. 

For instance let us explain why the characteristic function of the sequence 2^ = 1, 
2"'^ = 2, 2^ = 4, ... of powers of 2 is 2-automatic. Take g — 2, Q = {i, a, b}, and 
define the transition map by 

0[i] = i, 0[a] = a, 0[b] = b, 
l\i]=a, l[a]=b, l[b] = b. 

A convenient notation is given by the following diagram. 



000 




1 



One easily checks, on writing the sequence of positive integers in base 2, that 
0[i\^i, l[i]^a, 10[i]=a, 11^=6, 
100[i\ = a, 101[i]=5, 110^ = 6, .... 

Next define /(i) = 0, /(a) = 1 and f{b) = 0. The output sequence 
aoaia2 ... = 01101000100000001000 . . . 

is given by 

r 1 if n is a power of 2, 
" [0 otherwise. 
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According to Cobham [35], automatic sequences have complexity p{m) — 0{m); 
see also \\3\ Section 10.3]. Also, automatic sequences are the same as uniform 
morphic sequences; see for instance |151 Section 6.3] and [H Theorem 4.1]. 

Automatic sequences are between periodicity and chaos. They occur in con- 
nection with harmonic analysis, ergodic theory, fractals, Feigenbaum trees, quasi- 
crystals, transition phases in statistical mechanics, and others; see [TU [Ml [T^ and 
[la Chapter 17]. 

2.3. Examples. 

2.3.1. The Fibonacci Word. Consider the alphabet A — {a,b}. Start with /i = b, 
/2 a and define /„ = /„_i/„_2. Then 

/a = ab, /4 = aba, /s — abaab, fe = abaababa, fr = abaababaabaab, .... 

There is a unique word 

w ~ abaababaabaababaababaabaababaabaab . . . 

in which /„ is the prefix of length Fn, the Fibonacci number of index n, for n ^ 2. 
This is the Fibonacci word; it is generated by a binary recurrent morphism [151 
Section 7.1] and is the fixed point of the morphism a i— > a6, b t—^ a; under this 
morphism, the image of /„ is fn+i- 

Let us check that the Fibonacci word is not periodic. Indeed, the word /„ has 
length Fn, and consists of F„_i letters a and Fn-2 letters b. Hence the proportion 
of a in the Fibonacci word w is l/<&, where $ is the golden number 

^ 1 + ^/5 

which is an irrational number. On the other hand, for a periodic word the propor- 
tion of each letter is a rational number. 

Remark. The proportion of 5 in w is l/(f>^, with l/$ + l/$^ = 1, as expected! 

Proposition 2.1. The Fibonacci word is Sturmian. 

We write the factors of length 1, 2, 3, 4, 5, . . . of the Fibonacci word in columns 
as shown. 

aabaa 

a — > aa — > aab — > aaba — > aabab 

^ ab —> aba — > abaa —> abaab 

^ abab ababa 
b ^ ba baa baab — > baaba 

^ bab — baba — > babaa 
The fc-th column contains -I- 1 elements: one of them has two right extensions 
to a factor of length fc -|- 1, and all remaining k factors of length k have a single 
extension. This is easily seen to be a characterization of Sturmian words. 
Concerning the sequence 

Mn^o = (0, 1, 0, 0, 1, 0, 1, 0, 0, 1, 0, 0, 1,0,.. .), (5) 

derived from the Fibonacci word on the alphabet {0, 1}, a result of Danilov [41] in 
1972 states that for all integers g ^ 2, the number 

n^O 

is transcendental; see also [HI Theorem 4.2]. 
Proposition 2.2. The Fibonacci word is not automatic. 
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Proposition 12.21 follows from a result of Cobham [35) which shows that the fre- 
quency of a letter in an automatic word, if it exists, is a rational number. 

The origin of the Fibonacci sequence is a model of growth of a population of 
rabbits. Denote a pair of young rabbits by Y and a pair of adults by A. From one 
year to the next, the young pair become adult, which we write as Y A, while the 
adult pair stay alive and produce a young pair, which we write as ^ — > AY . This 
gives rise to the dynamical system 

Y A-^ AY AY A ^ AY A AY -> AY A AY AY A ^ . . . , 

and the sequence {Ri, R2, R3, . . .) of Fibonacci rabbits 

A, Y, A, A, Y, A, Y, A, A, Y, A, A, Y, A, ... . 

Replacing F by 1 and A by produces the sequence (O considered by Danilov. 

Any integer n ^ 2 has a unique representation as the sum of distinct Fibonacci 
numbers Fm, m ^ 2, with the property that no Fibonacci numbers with consecutive 
indices occur in the sum. This representation yields the following algorithm to 
decide whether i?„ is A or Y. If the smallest index in the decomposition of n is even, 
then Rn = A. If it is odd, then i?„ = Y. For instance, for 51 = i^g + F7 + F4 + F2, 
the smallest index, namely 2, is even, so R51 = A. 

Recall that <I> = (1 + \/5)/2 denotes the golden number. The sequence of indices 
n for which i?„ — A is 

L$J - 1, L2$J =3, L3*J =4, L4$J =6, 

while the sequence of indices n for which i?„ = y is 

[$2J = 2, [2<I>2J = 5, [3$2J = 7, [4$2J ^8, .... 

For instance, 32$ = 51.77. . ., so [32<I>J = 51 and R51 = A. 

This sequence (i?„)„^i of rabbits is an example of a Beatty sequence. 

2.3.2. The Prouhet-Thue-Morse word abbabaabbaababbab .... The finite automaton 






with f{i) = and /(a) — 1 produces the sequence 090102 . . . , where a„ = f{n[i]). 
For instance, with n = 9, since 9 is 1001 in binary notation, 

lOOlH = 100[a] = 10[a] = l[a] = i, 

and f{i) = 0, we have og = 0. 

This is the Prouhet-Thue-Morse sequence 

01101001100101101..., 

where the (n + l)-th term a„ is 1 if the number of I's, which is the same as the 
sum of the binary digits, in the binary expansion of n is odd, and is otherwise; 
see [13 Section 1.6]. 

If, in the Prouhet-Thue-Morse sequence, we replace by a and 1 by 6, we obtain 
the Prouhet-Thue-Morse word on the alphabet {a, 6}, and this starts with 

w — abbabaabbaababbab .... 

This word is generated by a binary recurrent morphism; see [151 Section 6.2]. It is 
the fixed point of the morphism a ab, b ^ ha. 
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An interesting observation, due to Thue in 1906, is that if w is a finite word and 
a a letter for which wwa is a factor of the Prouhct-Thue-Morse word, then a is 
not the first letter of w. Therefore, in the Prouhet-Thue-Morse sequence, no three 
consecutive identical blocks such as 000 or 111 or 010101 or 101010 or 001001001 
can occur. 

2.3.3. The Baum-Sweet Sequence. For n ^ 0, let a„ = 1 if the binary expansion of 
n contains no block of consecutive O's of odd length, and a„ = otherwise. The 
Baum-Sweet sequence (a„)„^o starts with 

110110010100100110010.... 

This sequence is automatic, associated with the automaton 




with f{i) = 1, /(a) = and f{b) = 0; see [HI Example 5.1.7]. 

2.3.4. Powers of 2. As we have seen, the binary number 

^■■=Y^ 2-2" ^ 0.1101000100000001000 . . . = O.aiazaa . . . 

is 2-automatic. The associated infinite word 

v = V1V2V3 . . . — bbabaaabaaaaaaabaaa . . . , 

where 

( b if rt is a power of 2, 
" y a otherwise, 

has complexity p^m) bounded by 2m; the initial values are given below. 

m 1 2 3 4 5 6 ... 
p{m) 2 4 6 7 9 11 ... 

2.3.5. The Rudin- Shapiro Word. For n ^ 0, let r„ G {a, 5} satisfy r„ = a (re- 
spectively r„ = b) if the number of occurrences of the pattern 11 in the binary 
representation of n is even (respectively odd). This produces the Rudin- Shapiro 
word 

aaabaabaaaabbbab .... 

Let (T be the morphism defined from the monoid B* on the alphabet B = 
{1, 2, 3, 4} into B* by cr(l) = 12, cr(2) = 13, cr(3) = 42 and cr(4) = 43. Let 

u= 121312421213... 

be the fixed point of a beginning with 1 and let be the morphism defined from 
B* to {a, 6}* by = aa, ip{2) = ab, ip{3) = ba and ip{A) = bb. Then the 

Rudin-Shapiro word is (/?(u), hence it is morphic. 
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2.3.6. Paper Folding. Folding a strip of paper always in the same direction, and 
then opening it up, yields a sequence of folds which can be encoded using the digits 
and 1. The resulting sequence (u„)n^o, given by 

1101100111001001..., 

satisfies 

U4n — 1, U4n+2 — 0, U2„+l = Un, 

and is produced by the automaton 












with f{i) = f{a) = f{b) = 1 and /(c) = 0. 

An equivalent definition for this sequence is given as follows: the sequence a„ = 
Un+i, n ^ 1, is defined recursively by a„ = 1 if n is a power of 2, and 

(^2''+a = 1 ^ «2'=-ai 1 < a < 2'^; 

see [ini Example 5.1.6]. 

For a connection between the paper folding sequence and the Prouhet-Thue- 
Morse sequence, see jTS] . 

2.4. Complexity of the g-ary Expansion of an Algebraic Number. The 

transcendence of a number whose sequence of digits is Sturmian has been proved 
by Ferenczi and Mauduit 42J in 1997. The point is that such sequences contain 
sequences of digits which bear similarities, and yields the existence of very sharp 
rational approximations which do not exist for algebraic numbers. 

It follows from their work that the complexity of the g-ary expansion of every 
irrational algebraic number satisfies 

liminf (p(to) — m) = +oo; 

m — »oo 

see [TU]. The main tool for the proof is a p-adic version of the Thue-Siegel-Roth 
theorem due to Ridout in 1957; see Theorem 14.31 below as well as [3]. 

Several papers have been devoted to the study of the complexity of the g-aiy 
expansions of real algebraic numbers, in particular by AUouche and Zamboni in 
1998, Risley and Zamboni in 2000, and Adamczewski and Cassaigne in 2003. For 
a survey, see 1^. The main recent result is the following [S]. 

Theorem 2.3 (Adamczewski and Bugeaud, 2007). The complexity p{m) of the 
g-ary expansion of a real irrational algebraic number satisfies 

limmi = +00. 

m — ^oo rn 
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In 196^, Cobham PSI claimed that automatic irrational numbers are transcen- 
dental. This follows from Theorem l2.31 since automatic numbers have a complexity 
0{m). 

Corollary 2.4 (Conjecture of Cobham). // the sequence of digits of an irrational 
real number x is automatic, then x is transcendental. 

The main tool for the proof of Theorem l2.3l is a new, combinatorial transcendence 
criterion obtained by Adamczewski, Bugeaud and Luca [7] as an application of 
Schmidt's subspace theorem; see Theorem 14.41 and [60) . 

In 1979, Christol |36| proved that when p is a prime and 

f{X) = Y,UkX'' ^Fp[[X]] 

k^l 

a power series with coefficients in the finite field Fp, then / is algebraic if and only 
if the sequence of coefficients (ufe)fe^i can be produced by a p-automaton. 

According to Cobham, a sequence can be simultaneously fc-automatic and £- 
automatic with k and £ multiplicatively independent if and only if it is ultimately 
periodic. Christol, Kamae, Mendes France and Rauzy P? have shown that a power 
series with coefficients in a finite set can be algebraic over two different finite fields 
if and only if it is rational. 

In this context Theorem 12.31 implies the following statement. 

Corollary 2.5. Let p be a prime number, g ^ p an integer and (ufe)fe^i a sequence 
of integers in the range {0, . . . ,p — 1}. The formal power series 

k^l 

and the real number 

^Ukg-'' 

k^l 

are simultaneously algebraic over Fp{X) and over Q respectively if and only if they 
are rational. 

As an example, taken from [TTl Section 6] and [12 Section 2.4]), consider the 
Prouhet-Thue-Morse sequence (a„)„^o- The series 

F(X) = ^a„X" 

is algebraic over F2{X), as it is a root of {1 + X)^F'^ + {1+X)'^F+X = 0. Since F is 
not a rational function, Corollarv l2 . 51 gives another proof of Mahler's transcendence 
result on the number 

The following result related to Theorem 12.31 has been obtained subsequently by 
Bugeaud and Evertse [32] by means of a refinement of the so-called Cugiani-Mahler 
theorem. The main tool is again a quantitative version of Schmidt's subspace 
theorem 

Theorem 2.6 (Bugeaud and Evertse, 2007). Let b ^ 2 be an integer and ^ an 
irrational algebraic number with < ^ < 1. Then for any real number rj < 1/11, 
the complexity p(rn) of the b-ary expansion of ^ satisfies 

p{m) 

limsup — — ^ +00. 

m^+oo m(logTO)'' 



The author is grateful to Boris Adamczewski for this reference. 
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Further developments of the method of [7', 'S] have recently been achieved by 
Adamczewski and Rampersad [9 who have shown that the binary expansion of an 
algebraic number contains infinitely many occurrences of 7/3-powers. Here, for a 
positive real number w, a w-th power is a word of the form M^L^^Jty, where W is 
a word of length \W\, say, and W is the prefix of W of length \{w — lw\)\W\']. 
For instance with w — 7/3 and W = 011010 we have \W\ — 6, [wj = 2 and 
\{w- N)|VFn = 2, so 011010 011010 01 is a 7/3-th power. 

Adamczewski and Rampersad deduce from their result that the binary expansion 
of an algebraic number contains infinitely many occurrences of overlaps, where an 
overlap is a pattern of the form xXxXx, where a; is a letter and X a word. 

3. Analytic Methods 

3.1. Mahler's Method. Mahler i48j initiated a new transcendence method in 
1929 for studying values of functions satisfying certain functional equations. His 
work was not widely known and in 1969 he published |49j which was the source of 
a revival of this method; see [47l [52] . 

A first example [SH Theorem 1.1.2] is the function 

which satisfies the functional equation/(z'^) + z ~ f{z) for \z\ < 1. Mahler proved 
that it takes transcendental values at algebraic points in the domain < |z| < 1. 
A special case is ([3]). 

Mahler also proved in 1929 that the so-called Prouhet-Thue- Morse- Mahler num- 
ber in base g ^ 2, given by 

where(a„)„^o is the Prouhet-Thue-Morse sequence, is transcendental; see ^2] and 
[m Section 13.4]. The idea of proof is as follows; see [13 Section 13.4] and [SH 
Example 1.3.1], where the complete proof is given. Consider the function 

/(z) = " -^^"^ '^^^'^^ satisfies /(z) = ^(-l)""z". 

ri^O n^O 

For a e {0, 1}, we can write (—1)° = 1 — 2a. Hence 

/(z) = ^(1 - 2a„)z" = ^ - 2 ar^z-. 

Using the functional equation /(z) = (1 — z)f{z'^), Mahler proves that f{a) is 
transcendental for all algebraic numbers a satisfying < \a\ < 1. 

Another application ,20j of Mahler's method is the transcendence of the number 
^ whose digits in a given base g are 1 at the Fibonacci indices 1,2,3,5,8,..., and 
elsewhere. 

After Cobham 39J, Loxton and van der Poorten also triecH in 1982 and 1988 
to use Mahler's method to prove Cobham's conjecture, now Corollary 12.41 of The- 
orem 12.31 of Adamczewski and Bugeaud. Becker [25 pointed out in 1994 that 
Mahler's method yields only a weaker result so far, that for any given non-eventually 
periodic automatic sequence u = (ui, 1*2,1*3 . . .), the real number 



^See [47] and [15] Section 13.10]. 
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is transcendental, provided that the integer g is sufficiently large in terms of u; see 
also [S]. It is yet a challenge to extend Mahler's method in order to prove Cobham's 
conjecture. 

There is a further very interesting development of Mahler's method which we 
only allude to here without entering the subject. It is due to Denis and deals 
with transcendence problems in finite characteristic; see Pellarin's report : 5Si] in 
the Bourbaki Seminar. 

3.2. Nesterenko's Theorem and Consequences. The transcendence of the 
Liouville-Fredholm number ^ below was studied only ten years ago by Bertrand 
in 1997, and Duverney, Nishioka as well as Nishioka and Shiokawa in 1998, as a 
consequence of the results due to Nesterenko's work in 1996 on the transcendence 
of values of theta series at rational points involving modular functions; see [511 
Chapter 3, Section 1.3]. It follows from these works that for algebraic a in the 
domain < jal < 1, the three numbers 

n^O n^sO n^O 

are algebraically independent. 

A related example from ^ is the number 

77 = ^7/^3-'= 

which is associated to the word 

u = 01242^12^12*12^12^12^ . . . , 

where, for instance, 2'* denotes 2222, generated by the non-recurrent morphism 
^ 012, 1 12, 2 2. Using 

^=l_^3-n(«+l)/2^ 

one deduces from Nesterenko's result that it is transcendental. For this number the 
growth of complexity p{m) is quadratic in m, so Theorem 12.31 does not apply. 

4. DioPHANTiNE Approximation 

4.1. Liouville, Thue, Siegel, Roth, Ridout, Schmidt. Diophantine approxi- 
mation theory yields information on the arithmetic nature of numbers of the form 

For instance, if the sequence satisfies 

Un+l — Un +00, 71 

then the number given by ^ is irrational. This follows from the fact that a real 
number is rational if and only if its g-aiy expansion is ultimately periodic. It also 
follows from Diophantine analysis. As a special case take u„ — n?. The irrationality 
of the Liouville-Fredholm number 

^ = E5""' (7) 

follows. 
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Here is the very simple proof, which goes back to Liouville in 1851; see [511 
Chapter 3, Section 1.3]. Let e > 0. Let be a sufhciently large integer. Set 

N 

q = and p = 



n=0 



Then 

For N sufficiently large the right hand side is less than e and the irrationality of 9 
is proved. 

That 6 is not a quadratic irrationality follows from the work of Bailey and Cran- 
dall [55]. See Section [221 for the transcendence of this number 9. 

The first transcendence statement on numbers given by a series ©, assuming 
that the sequence (un)n^o is increasing and grows sufficiently fast, goes back to 
Liouville in 1844. 

Theorem 4.1 (Liouville, 1844). For any real algebraic number a, there exists a 
constant c > such that the set ofp/q G Q with \a — p/q\ < q~'^ is finite. 

Liouville's theorem yields the transcendence of the value of a series like ©, 
provided that the increasing sequence (un)n^o satisfies 

limsup = +00. 

n. — >r>o 



For instance, u„ — n\ satisfies this condition, so the number 
is transcendental. 

Theorem 4.2 (Thue, Siegel, Roth [STlllH], 1955). For any real algebraic number 
a and any e > 0, the set ofp/q £ Q with \a — p/q\ < q^^^'^ is finite. 

From Theorem 14. 2[ one deduces the transcendence of the series ^ under the 
weaker hypothesis 

limsup > 2. 

n — >oo 'ayi 

The sequence m„ = [k"J satisfies this condition as soon as k > 2. For example, 
the transcendence of the number 

n^O 

follows from Theorem 14.21 

A stronger result follows from Theorem l4.3l below due to Ridout (see, for instance, 
[m Section 13.5]), using the fact that the denominators g"" are powers of g. The 
condition 

r """+1 ^ 1 
iim sup > 1 

n — *oo 'ayi 

is sufficient to imply the transcendence of the sum of the series see [T]. An 
example is the transcendence of 

see also ([3]) above. 



Theorem 4.3 (Ridout, 1957). For any g ^ 2, any real algebraic number a and 
any e > 0, the set ofp/qE Q with q = and \a — p/q\ < q^^^^ is finite. 
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The theorems of Thue-Siegel-Roth and Ridout are very special cases of Schmidt's 
subspace theorem [HD], estabhshed in 1972, together with its p-adic extension by 
Schhckewei in 1976. We state only a simplified version; see Bilu's Bourbaki lecture 
[27] for references and further recent achievements based on this fundamental result. 

For X = (xo, . . ■,x„^-l) £ Z™, let |x| = max{|a;o|, . . . , |a;m_i|}. 

Theorem 4.4 (Schmidt's subspace theorem). Let m ^ 2 be an integer, S a finite 
set of places of Q containing the infinite place. For each v € S , let io,i;, • ■ • , im-i,u 
be m independent linear forms in m variables with algebraic coefficients in the 
completion of Q at v. Let e > 0. Then the set of "x. = [xq, . . . , Xm-i) G Z™ for 
which 

^m — 1,1) 

(x)|„ ^ |x| ^ 

is contained in the union of finitely many proper subspaces of Q™ . 

Theorem 14.21 due to Thue-Siegel-Roth follows from Theorem 14.41 if one takes 
5 = {oo}, m = 2, L(3{x(3,xi) = xo, Li{xt^,xi) ^ ax^ - xi. 
Theorem 14.31 due to Ridout is also a consequence of Theorem 14.41 if one takes 
S = {oo} \J{1:1 prime and I \ g) 

and 

-^^o,oo(a;o,a:^i) = xa, Li^ao{xQ,xi) = ax^ - xi, 
Lo4(xo,xi) ^ xq, Li^i{xo,xi) ^ xi. 
Since there is no loss of generality to assume that g is squarefree, so that 

g-W, 

for (a;o, xi) = [ItP) with q ^ g^ , we have 

|io,oo(a;o,a:i)|oo = Q, |ii,oc,(a;o, a;i)|oo = \qa - p\, 

\LQ^e{xQ,xi)\i = Y\_ \Li^i{xQ,xi)\e = \p\i ^ 1. 

i\9 i\g 
Further applications of the Subspace theorem to transcendence questions have 
been obtained by Corvaja and Zannier |40j . 

4.2. Irrationality and Transcendence Measures. The previous results can be 
made effective in order to reach irrationality measures or transcendence measures 
for automatic numbers. 

In 2006, Adamczewski and Cassaigne [5] solved a conjecture of Shallit in 1999 by 
proving that the sequence of g-ary digits of a Liouville number cannot be generated 
by a finite automaton. They obtained irrationality measures for automatic numbers. 
Recall that the irrationality exponent of an irrational real number x is the least 
upper bound of the set of numbers k for which the inequality 

P ^ 1 
X < — 

q 9" 

has infinitely many solutions p/q. 

For instance, a Liouville number is a number whose irrationality exponent is 
infinite, while a real irrational algebraic number has irrationality exponent 2 by 
Theorem 14.21 a-s do almost all real numbers. 

An explicit upper bound for the irrationality exponent for automatic irrational 
numbers is given by ^ Theorem 2.2]. For the Prouhet-Thue-Morse-Mahlcr numbers 
for instance, the exponent of irrationality is at most 5. However there is no uniform 
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upper bound for such exponents, as pointed out to the author by Adamczewski. 
The irrationahty exponent for the automatic number associated with cr(0) = 0"1 
and (7(1) = 1"0 is at least n. 

Recently Adamczewski and Bugeaud (6l have obtained transcendence measures 
for automatic numbers. They show that automatic irrational numbers are either S- 
or T- numbers in Mahler's classification of transcendental numbers; see [30] ■ This 
is a partial answer to a conjecture of Becker [S] which states that all automatic 
irrational numbers are S'-numbers. 

5. Continued Fractions 

We discussed above some Diophantine problems which are related with the g- 
ary expansion of a real number. Similar questions arise with the continued fraction 
expansion of real numbers. 

5.1. Complexity of the Continued Fraction Expansion of an Algebraic 
Number. In 1949, Khinchin asked the following question: Are the partial quo- 
tients of the continued fraction expansion of a real non-quadratic irrational alge- 
braic number bounded? So far no example is known. It is not yet ruled out that 
all these partial quotients are bounded for all such x, or that they are unbounded 
for all such x. The common expectation seems to be that they are never bounded. 
The situation in finite characteristic is quite different. In 1976, Baum and Sweet 
[23] constructed a formal series which is cubic over the field F2{X), the continued 
fractions of which have partial quotients of bounded degree. 

We give here only a very short historical account. The very first transcendence 
results for numbers given by their continued fraction expansions are due to Liouville 
in 1844. This topic was extensively developed by Maillet in 1906 and later by Per- 
ron in 1929. In 1955, the Thue-Siegel-Roth theorem enabled Davenport and Roth 
to obtain deeper results. Further investigations are due to Baker in 1962 and 1964. 
The approximation results on real numbers by quadratic numbers due to Schmidt in 
1967 are a main tool for the next steps by Davison in 1989, and by Queffelec [SS] who 
established in 1998 the transcendence of the Prouhet-Thue-Morse continued frac- 
tion; see also [15l Section 13.7]. There are further papers by Liardet and Stambul 
in 2000 and by Baxa in 2004. In 2001, AUouche, Davison, Queffelec and Zamboni 
[T5] proved the transcendence of Sturmian continued fractions, namely continued 
fractions whose sequence of partial quotients is Sturmian. The transcendence of 
the Rudin-Shapiro and of the Baum-Sweet continued fractions was proved in 2005 
by Adamczewski, Bugeaud and Davison. In 2005, Adamczewski and Bugeaud [2] 
showed that the continued fraction expansion of an algebraic number of degree at 
least three cannot be generated by a binary morphism. 

5.2. The Fibonacci Continued Fraction. The Fibonacci word discussed in Sec- 
tion [2311] enabled Roy [59] to construct transcendental real numbers ^ for which ^ 
and are surprisingly well simultaneously and uniformly approximated by rational 
numbers. 

Recall once more that $ denotes the golden number, so that 

= $ - 1 = = 0.618 .... 

2 

Theorem 5.1 (Roy, 2003). Let A and B be two distinct positive integers. Let 
5 G (0, 1) be the real number whose continued fraction expansion is obtained from 
the Fibonacci word w by replacing the letters a and b by A and B respectively to 
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obtain 

[0; A, B, A, A, B, A, B, A, A, B, A, A, B, A, B, A, A, B, A, B, A, 
A, B, A, A, B, A, B, A, A, B,A,A,B.. .]. 

Then there exists c > such that the inequalities 

0<xo^X, |xo^-xiKcX-*"\ \xoe - X2\ ^ cX-"^'" 

have a solution in for all sufficiently large values of X . 

The arguments of Roy provide simplified proofs of Queffelec's results on the 
transcendence of the Thue-Morse and Fibonacci continued fractions. 

Using Theorem 15.11 together with ideas of Davenport and Schmidt involving a 
transference theorem concerning Mahler's convex bodies, Roy also produced tran- 
scendental numbers which are surprisingly badly approximated by cubic algebraic 
integers. 

Further results on Diophantine approximation of Sturmian continued fractions 
and on the simultaneous approximation of a number and its square have been ob- 
tained by Bugeaud and Laurent, Fischler, Roy, and more recently by Adamczewski 
and Bugeaud 6 . 

For further references on Diophantine approximation, we refer the reader to 
Bugeaud's recent book [30] , 

6. Open problems 

Many problems related to the present topic are open. To trace their original 
sources would require more thorough bibliographical investigation. While the ori- 
gins of these questions are most often much older, we give references to recent 
papers where they are quoted. 

Among the open questions raised in 2J is the following: 

• Does there exist an algebraic number of degree at least 3 whose continued 
fraction expansion is generated by a morphism? 

In the same vein the next question is open: 

• Do there exist an integer g ^ 3 and an algebraic number of degree at least 
3 whose expansion in base g is generated by a morphism? 

Two open questions, for which positive answers are expected, are proposed in [9]: 

• Is it true that the binary expansion of every algebraic number contains 
arbitrarily large squares? 

• Is it true that the binary expansion of every algebraic number contains 
arbitrarily large palindromes? 

Recall that a square is nothing other than a 2-power, namely a pattern XX where 
X is a word, while a palindrome is a word W — wiW2 . . .Wr which is invariant under 
reversal: if W denotes the word Wr ■ ■ ■ W2W1, then W — W. Hence a palindrome is 
either of the form XX or of the form XxX where X is a word and x a letter. 
Other problems are suggested by Rivoal [5S] : 

• Let g ^ 2 he an integer. Give an explicit example of a real number a; > 
which is simply normal in base g and for which 1/x is not simply normal 
in base g. 

• Let g ^ 2 he an integer. Give an explicit example of a real number a; > 
which is normal in base g and for which 1/x is not normal in base g. 

• Give an explicit example of a real number a; > which is normal and for 
which l/x is not normal. 
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Remark. In PSI, there is a construction of an automatic number, the inverse of 
which is not automatic. This answers by anticipation 15, Section 13.9, Problem 2]. 

From the open problems in T5J Section 13.9], we select the following two: 

• Show that the number log 2 is not 2-automatic; see ([T]). 

• Show that the number tt is not 2-automatic; see ([2|. 

We conclude this paper with one last open problem, attributed to Mahler; see 
for instance {¥: 

• Let (e„)„j.i be an infinite sequence over {0, 1} that is not ultimately peri- 
odic. Is it true that at least one of the two numbers 

J2 en2-" and ^ e„3"" 

is transcendental? 

From Conjecture 11.11 with g = 3 and a = 2, it follows that the second number 
should be always transcendental. 
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